Computation of probabilities for island-driven parsers

نویسندگان

  • A. Corazzat
  • Renato De Mori
  • Roberto Gretter
  • Giorgio Satta
چکیده

(A(x) j n s) is essentially a performance model obtainable by experiments of a system that segments a speech signal into syllables, n s being the number of actual syllables in the sentence corresponding to the signal. Pr(n s j m) is the probability that a string of m words is made up of n s syllables; it can be estimated from a written text. Once the above probabilities have been computed, it is possible to estimate Pr(S < uxvy >) as follows: (14) where Pr(A(x) j m) < " for m m min and m m max. It is also possible to introduce other heuristics, such as weight-ing the latter probability Pr(S < ux (m) vy () >) with some quantities obtained from Pr(A(x) j m). DISCUSSION We think that the presented theory will be eeective when the task at hand has certain characteristics. In particular , a good task should be sentence interpretation in restrict domains, because in this case island predictions can be guided by extrasyntactical sources of knowledge (for example semantic and pragmatic heuristics). In addition, a word spotter should be required to nd the words predicted by the system within the speech signal; the performance of the word spotter is improved by the fact that we only look at a limited number of \higly informative keywords". Our theory accounts for bidirectional expansion of partial analyses; this results in an improvement of the predictive capabilities of the system. In fact, bidirectional strategies can be used in restricting the (stochastic) syntactic search space for gaps surrounded by two partial analyses. This point has been discussed in 7] for the cases of one word length gaps. Notice that our theory generalizes such cases to m-length gaps as well as to case when partial analyses are not partially instantiated context-free productions, but partial derivation trees in all their generality. So far we have discussed how to score a given theory. It is also possible to use the presented framework in predicting words adjacent to an already recognized string, in order to extend the corresponding theory. This can be done by selecting the word(s) which maximize the probability of the theory augmented with it. Instead of computing these probabilities for all symbols in the dictionary, it is possible to restrict such an expensive process to the preterminal symbols (see 2]).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computation of upper-bounds for island-driven stochastic parsers

A theoretical framework has been introduced for the computation of syntactic upper-bounds for partial analyses obtained by an island-driven parser. Several proposals have been advanced in the ASR literature for the use of Alike algorithms, based on the computation of acoustical upper-bounds. Syntactic upper-bounds can be integrated with the acoustical ones to drive the search process toward bot...

متن کامل

Some Results on Stochastic Language Modelling

ISLAND-DRIVEN PARSING Language Modeling and Theories with Is lands Automatic Speech Understanding (ASU) is based on a search process that generates part ia l interpretations of a spoken sentence cared theoeies; theories are scored on the basis of a likellhood L = O(Pr (A I ¢h) Pr( th)) . We are interested in the computat ion of Pr ( th) when th is a part ial interpretat ion of a spoken sentence...

متن کامل

External and Internal Incompressible Viscous Flows Computation using Taylor Series Expansion and Least Square based Lattice Boltzmann Method

The lattice Boltzmann method (LBM) has recently become an alternative and promising computational fluid dynamics approach for simulating complex fluid flows. Despite its enormous success in many practical applications, the standard LBM is restricted to the lattice uniformity in the physical space. This is the main drawback of the standard LBM for flow problems with complex geometry. Several app...

متن کامل

Explaining Heterogeneity in Risk Preferences Using a Finite Mixture Model

This paper studies the effect of the space (distance) between lotteries' outcomes on risk-taking behavior and the shape of estimated utility and probability weighting functions. Previously investigated experimental data shows a significant space effect in the gain domain. As compared to low spaced lotteries, high spaced lotteries are associated with higher risk aversion for high probabilities o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990